Learning Weighted Association Rules in Human Phenotype Ontology
نویسندگان
چکیده
The Human Phenotype Ontology (HPO) is a structured repository of concepts (HPO Terms) that are associated to one or more diseases. The process of association is referred to as annotation. The relevance and the specificity of both HPO terms and annotations are evaluated by a measure defined as Information Content (IC). The analysis of annotated data is thus an important challenge for bioinformatics. There exist different approaches of analysis. From those, the use of Association Rules (AR) may provide useful knowledge, and it has been used in some applications, e.g. improving the quality of annotations. Nevertheless classical association rules algorithms do not take into account the source of annotation nor the importance yielding to the generation of candidate rules with low IC. This paper presents HPO-Miner (Human Phenotype Ontology-based Weighted Association Rules) a methodology for extracting Weighted Association Rules. HPO-Miner can extract relevant rules from a biological point of view. A case study on using of HPO-Miner on publicly available HPO annotation datasets is used to demonstrate the effectiveness of our
منابع مشابه
Optimal Rule Selection Scheme using Concept Relationship Analysis
In Data Mining, the Association rule mining is used to retrieve the recurrent item sets. Apriori algorithm is mainly used to mine association rules. In that, rule reduction is required for efficient decision-making system. Knowledge based rule reduction schemes are used to filter the interested rules. In the existing system rule validation is not provided. Quantitative attributes are not consid...
متن کاملAswaacc Automatic Semantic Web Annotation by Applying Associative Concept Classifier in Text
After appearance of semantic web, the framework which is machine-readable and machine-understandable, by Berners Lee, current web should be annotated by W3C standards in order to define semantic domain of each word by its ontology to alleviate the posed problems in the realm of search and information retrieval. However annotation is one major problem in the semantic web domain, which is present...
متن کاملIdentifying Human Phenotype Terms by Combining Machine Learning and Validation Rules
Named-Entity Recognition is commonly used to identify biological entities such as proteins, genes, and chemical compounds found in scientific articles. The Human Phenotype Ontology (HPO) is an ontology that provides a standardized vocabulary for phenotypic abnormalities found in human diseases. This article presents the Identifying Human Phenotypes (IHP) system, tuned to recognize HPO entities ...
متن کاملOntology-driven Association Rules Extraction: a Case of Study
This paper proposes an integrated framework for extracting Constraint-based Multi-level Association Rules with an ontology support. The system permits the definition of a set of domain-specific constraints on a specific domain ontology, and to query the ontology for filtering the instances used in the association rule mining process. This method can improve the quality of the extracted associat...
متن کاملMining Weighted Association Rules for Fuzzy Quantitative Items
During the last ten years, data mining, also known as knowledge discovery in databases, has established its position as a prominent and important research area. Mining association rules is one of the important research problems in data mining. Many algorithms have been proposed to nd association rules in large databases containing both categorical and quantitative attributes. We generalize this...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1701.00077 شماره
صفحات -
تاریخ انتشار 2016